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1. INTRODUCTION 

In recent years, the capacity to differentiate between herbs pesticides-processed has attracted 
considerable attention from consumers due to the rates of poisoning and chronic illnesses caused by 
consuming contaminated food. This task is done using chemical materials like gas chromatography alone or 
coupled with mass spectrometry [1], analysis by gas chromatography coupled with olfactometry [2], or ion 
mobility spectrometry (IMS) [3]. These techniques are expensive and require a trip to the laboratory, hence 
the idea of simplifying the quality control process. Lately, the multi-sensor system is widely used to evaluate 
the quality of food, it is a non-destructive procedure using machine learning algorithms, this tool has been 
used by several researchers in the agro-processing industry, to control the quality of fruits [4], vegetable [5], 
meats [6], and fish [7]. It also makes it possible to identify counterfeits, for example for, meats, it makes it 
possible to distinguish mixtures of meats, mutton with pork [8], or adulterated beef with pork [9]. 

In electronics and especially multi-sensors systems, machine learning algorithms are a fundamental 
and crucial component. They are used to process large and complex data for the purpose of extracting useful 
information and to improve decision making. They are used for data classification as well as for regression. 
Machine-learning algorithms are supervised [10], unsupervised [11] or semi-supervised [12] but the best 
known and the most used in the classification are supervised algorithms such as decision tree (DT), naive 
Bayes, k-nearest neighbours algorithm (KNN), support vector machines (SVM), and some artificial neural 
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networks (ANN) algorithms, like multilayer perceptron (MLP) [13]. These methods try to establish a relation 
between the input and the given target output in the first phase of training (learning) to form the model and 
use it during the test to make decisions. For the regression we find linear regression, logistic regression, 
poisson regression, partial least squares regression (PLSR), support vector machines regression (SVMR) and 
back propagation artificial neural network (BP-ANN) [14], [15]. There isn’t a method that is effective for all 
the problems studied but each application requires carrying out several comparisons between the methods in 
order to determine the most suitable classifier. 

In order to get the best result, many studies have compared machine learning methods in order to 
choose the most efficient algorithms, such as in the case of the soil salinity approximation from hyperspectral 
data where they compared three PLSR algorithms, the SVM and deep learning techniques [16], and they 
found that the PLSR is the most suitable for this case. In another study where the objective was the 
estimation of forest parameters, they used the classification and regression tree (CART), the SVM, the ANN, 
and the random forest (RF). The RF algorithm turned out to be the most efficient [17]. In another case for the 
estimation of gas concentration based on an electronic nose, they used partial least squares (PLS) and SVM 
regression. SVM regression provided better generalization and precision [18]. Another study for the 
comparative evaluation and analysis of three machine learning algorithms in a controllable environment. 
They confirmed that basic propagation (BP) works better without being limited to severely polluted air 
conditioning but in addition to moderately polluted air conditioning compared to radial basis function (RBF) 
neural network and support vector regression (SVR) [19]. 

Mint is an aromatic herb widely consumed in the world and it is the subject of this study given the 
consumption rate of this vital substance either in cooking or as a therapeutic plant used in the preparation of 
medicines as well as in the beautifying products preparation, but the increasing demand for this herb drives 
the farmers to use insecticides to protect their harvests and to avoid the damage caused by the pests of this 
sensitive plant in order to meet the demand. The lifespan of insecticides product on plants can range from a 
week and sometimes up to a month, depending on the insecticides and the plant being treated, but nescience, 
voracity, and growing demand cause farmers to miss deadlines pre-registered. The scope of this research is 
mainly to compare the performance of the classification and regression algorithms, on the one hand, to 
differentiate the processed mint and the untreated one, and on the other hand to expect the treatment day with 
precision using different variations of machine learning algorithms. In our investigation we have employed 
for the classification: DT, naive bayes, SVM and ensemble classifier. For the regression: SVMR and PLSR. 
This paper is prepared in accordance with: the section “Materials and methods” depicts the mint used and 
sampling, the multi-sensor system, the data pre-processing, and the data analysis which we will present the 
machine learning methods used. The section “Results and discussion” presents the results of the different 
choice algorithms for classification and regression. 


2. METHODS AND PROCEDURES 
2.1. Mint used 

Samples of our hand-grown mint were used within the walls of the ENSA Khouribga, Morocco, and 
picked freshly after ripening. The first seedlings were purchased locally. The transplanted field is divided 
into two, first preserved part remains unprocessed and another part is treated with the insecticide product 
Malyphos 50 containing a dosage of 500 g/l of the dangerous substance malathion. a dosage of 4 ml of this 
product diluted in one litre of potable water was used. Malathion is a carcinogenic organophosphate 
neurotoxic compound [20] causing many cases of poisoning forbidden in several countries such as the 
European Union (decision 2007/389/EC). Its chemical formula is C1OH19O6PS2. 


2.2. Electronic olfactory system and sampling 

The multi-sensors system’s main idea is to imitate the principle of the human olfactory system 
operation, the block diagram in Figure 1 shows the sensing concept of the multi-sensors system. A 
multi-sensors system is typically composed of three principal parts: the sensor matrix, the data preprocessor, 
and the data interpretation. The odour sensor array is constituted of: the odour of samples and sensor array, 
the data pre-processor is made up of: raw signals and signal pre-processing, and finally, the data 
interpretation is constituted of: data analysis and processing which we can do using software with the 
pattern-recognition algorithm to arrive at the classification and the prediction. As shown in Figure 1, the 
multi-sensor system made in our research laboratory compose of a ventilator, a sample space, a sensor space, 
an acquisition card, and a laptop. The main component of this system is the sensor network, it is composed of 
seven metal oxide gas sensors listed in Table 1. 
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Laptop acquisition card 


Figure 1. Schematic representation of our multi-sensors system 


Table 1. Sensors network utilized in the multi-sensors system 
Sensor Target gas 
TGS 822 Vapours of organic solvents 
TGS 826 Ammonia, Air, Isobutane, Hydrogen and Ethanol 
TGS 2620 Volatile organic vapours 
TGS 4161 Carbon dioxide 
MQ-7 Carbon monoxide 
MQ- 136 Hydrogen sulphide, ammonia, Air and carbon monoxide 
MQ-214 Methane LPG, Air, Isobutane and propane 


2.3. Measurement process and data collection 

Mint samples weighing 5+0.1 g of each kind is placed in the sample chamber from which it 
undergoes an airflow coming from the ventilator for eight minutes, the airflow provided with the mint 
volatile organic compounds will then be channelled to the sensor chamber. The mint volatile organic 
compounds enter into a chemical reaction with the sensitive layer of each sensor for 8 minutes. Each sensor 
delivers a signal according to its sensitivity to the chemical compounds of the odour. After each 
measurement, the sensor array is exposed to ambient air for 10 minutes to purify it until the responses of the 
sensors are observed returned to their initial position. 

The sensors signals will be analyzed using pattern recognition algorithms and data processing 
techniques that play an essential role in making a decision. The sensors’ reaction is recorded utilizing an 
acquisition card namely ADLINK USB 1901 DAQ and a program on the labview software. Three datasets 
were extracted: 

X: the dataset of untreated mint. 

X2: the dataset of mint treated. 

Xt: the full dataset. 

The resulting matrix, so, is made up of three columns (three sensor responses) and seventy lines (7 numbers 
of daysx2 mint typesx5 samples). 


2.4. Data pre-processing 
After data collection, the raw signals have been managed to have signals referenced to zero for 
reading ease the responses by (1): 
_ Vin-Vo 


y, = 2 (1) 


Vo 


With Vo, Vm and V, are successively the starting voltage, the real voltage and the output relative voltage. 
Then, the signals will be observed to extract the pertinent characteristics (features) from the sensor responses. 
Normalization is applied to the features using the column normalization by dividing each column by its 
maximum value: 


X= (2) 
J Max (Xj) 
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Xij is the ith sample of the jth sensor, Xi comprises all the p rractions for the sensors of the ith sample [21]. 
The normalized features extracted from the recorded data will be used in the data classification and 
prediction thanks to the different algorithms of machine learning. Regarding the treated mint discrimination 
from untreated one, data classification algorithms were used. As for the mint treatment day prediction, 
regression algorithms were exploited. 


2.5. Classification methods 
2.5.1. Decision trees algorithm 

The DT is a supervised machine learning algorithm that can be utilized either for classification or 
regression problems [22]. It is a simple method that is founded on the prediction of the response by following 
decisions in the tree from the root node (start) to a leaf node. Each branch is linked to decision criteria. first, 
the model is formed from the training data, and then it will be used to predict the class of the data in the case 
of classification or the value of the variable in the case of regression. In our case, we search to discriminate 
the mint treated from untreated one that is why we used the Statistics and machine learning toolbox™ trees 
which are binary. Each step in a prediction takes into check the value of one predictor (variable). 


2.5.2. Naive Bayes classifier 

Naive Bayes classifier is a widespread supervised machine learning algorithm used for 
classification. The algorithm exploits bayes’ theorem which is founded on the prior knowing of the 
circumstances related to an happening that can describe the probability of this event occurring. Bayes 
theorem is defined as (3): 


P(X|Y) = PooP(y |X) (3) 


P (Y) 


where X and Y are independent. P(X|Y) is the probability of X when Y has already happened. P(X) and P(Y) 
are the probabilities of two independent X and Y. P(Y|X) is the chance of Y when X has already happened 
[23]. 


2.5.3. Support vector machines (SVM) 

SVM method is a supervised learning algorithm that classifies the data using the linear or non-linear 
kernel function, this function attempts to distinct classes by hyper-planes [24]. The hyperplane passing 
through the centre of the maximum distance separating two hyperplanes is the best result. Suppose we have 
scores separated by two planes P1 and P2 whose maximum distance, and the optimal hyperplane PO 
separating P1 and P2. PO is defined by (4): 


P: W.X+b=0 (4) 


where X is a point lying on the hyperplane, W is normal to the hyperplane, b is the bias. The minimization of 
W leads to the maximization of the margin. 


2.5.4. Ensemble classifier 

Ensemble classifiers are supervised learning algorithms whose purpose is to combine the predictions 
of several basic learners who are members of the set into a single output that often performs better on average 
than any other member of the set with uncorrelated error in the target data sets [25]. This leads to building a 
more precise classification decision. Newer algorithms contain error-correcting output coding, Bagging, and 
boosting but the Bayesian averaging remains the original ensemble method. 


2.6. Regression methods 
2.6.1. Partial least squares regression (PLSR) 

PLSR is a multivariate statistical supervised method that discoveries a linear regression model by 
projecting predicted and observable variables into a new space. The principle of PLSR is similar to that of 
principal components regression (PCR), is to extract the orthogonal factors from the latent variables (LVs) 
and to create the regression relationship among the dataset and the corresponding reference value [26]. For 
example, for input data matrices X and output Y, the PLS model will try to find the multidimensional 
direction in the X space that describes the direction of maximum multidimensional variance in the Y space 
[27]. the objective of this regression is to find a linear function such that: 


B=AB+C (5) 
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A=TP" +E (6) 

B = UQ" +F ) 


where A is an nxm matrix of predictors, B is an nxp matrix of responses; T and U are nxl matrices that are, 
respectively, projections of A (the A score, component or factor matrix) and projections of B (the B scores); 
P and Q are, respectively, mxl and pxl orthogonal loading matrices; and matrices E and F are the error terms, 
assumed to be independent and identically distributed random normal variables. The decompositions of A 
and B are made so as to maximize the covariance between T and U. 


2.6.2. Support vector machines regression (SVMR) 

Unlike the previously mentioned support vector machines which are used for classification, SVMR 
is utilized to predict values [28]. The regression learner application in statistics and machine learning 
toolbox ™ used for this study has a specific SVM regression algorithm named linear epsilon insensitive 
SVM (e-SVM). In the e-SVM regression, the training dataset contains predictor variables and observed 
response values. The aim is to find a function B which diverges from yn by a value not more than e for each 
learning point x, and at the same time as flat as possible. 

To get a linear function (B=AB+C) and confirm that it is as plane as possible, find B with the 
minimal norm value (BB). This is formulated as a convex optimization problem to reduce: 


J (B) =5*B'B (8) 


All residuals must have a value less than e. In the absence of a function B that can verify these constraints for 
all the points. This problem can be resolved by entering the slack variables ģn and €*n for each point: 


JB) =5* BB+ CYR En + EA) (9) 
Subject to: 
Wn: yn-(Xn B+b)<SE+én, (xn’B+b)—ynseté*n,€*n=0, En=z0 


The constant C is the box constraint, a positive numeric value that controls the penalty obligatory on 
observations that lie outside the epsilon margin (e) and helps to avoid overfitting (regularization). This value 
determines the trade-off between the flatness of B and the amount up to which deviations larger than € are 
tolerated. The parameters mean squared error (MSE) and correlation coefficient (R) were utilized to evaluate 
the model performance of each method [29]. The correlation coefficient (R) measures the relationship 
between the outputs and the targets, if the value R is equal to 1 it means that the correlation is close while 0 
means that the correlation is random. The mean squared error (MSE) is the average squared difference 
between outputs and targets. Lower values are better. 


R = es Mit (10) 
2 2 
Dies 2e 

MSE = 1/n Xi- (Xi — Y)? (11) 


Where X; is the experimental value and Y; is the forecasted value while n is the value number. 


3. RESULTS AND DISCUSSION 

The insecticide malathion was used to treat part of the mint field and another was reserved 
untreated. Afterwards, the mint samples were grown fresh every day throughout one week and but in the 
sample enclosure. The gas sensors response in the existence of the mint headspace was recorded. It is 
noteworthy that only three sensors responded well to the presence of mint, it is the sensors MQ136, TGS822, 
and TGS2611, and only their data that will be processed. 
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In the responses collected from the sensors as in Figure 2 during the seven days of experiments, the 
headspace circulated in the sample chamber exhibited a global behaviour of increasing voltage detected at the 
sensors output with a noteworthy variance between the reactions of the two types of samples (untreated mint, 
mint treated with Malyphos). It was noted that the sensors responses for the treated mint differ from those not 
treated at the level of the signal supreme value and of the surface occupied by the signal as well as of at the 
stabilized value. The remarkable change which happens between the sensors responses is surely due to the 
change in the volatile ingredients generated by each kind of mint. These selected features will be exploited 
subsequently using machine learning algorithms. 
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Figure 2. Example of responses collected from a sensor for one day 


3.1. Discrimination of treated mint from untreated one based on classification algorithms 

In this study, the classification learner app from statistics and machine learning toolbox in 
MATLAB 2019a was used. It is a toolbox that provides functions and applications for describing, analysing, 
and modelling data. To evaluate the model, a 5-folds cross-validation method was applied to avoid 
overfitting and achieve classification accuracy. Figure 3 shows the result obtained for four methods. 

In the decision tree results in Figure 3(a), a low success rate of 87.1% was obtained, nine 
misclassified scores, five from the untreated mint and four from the treated one. For the naive Bayes results 
in Figure 3(b), the success rate achieved is 90% with six scores of the untreated mint and one of the treated 
one are misclassified. For support vector machines as shown in Figure 3(c), it arrived at a success rate of 
91.4% with four scores of untreated mint, and two of treated are misclassified. Finally, for the ensemble 
classifier as shown in Figure 3(d), it arrived at the best results by achieving a success rate of 92.9% and just 
five misclassified scores two untreated and three treated. Table 2 summarizes the results obtained. 

These results illustrated that the ensemble classifier provides better performance against decision 
tree, naive Bayes, and support vector machines in the discrimination of mint kind. It should be noted that in 
our first study on mint with the same insecticide but just in the first four days, the said study was crowned by 
the first articles [30], [31], a hybrid method with the use of scores from the statistical method of principal 
component analysis in the SVM method was used and the result was a success rate of 87.50% and the results 
of this current study are very better than the first one. 


3.2. Treatment day prediction based on regression algorithms 

In the prediction case, only the dataset of the mint treated according to the sampling day was used, it 
is composed of 9 columns (3 characteristics*3 sensors) and 35 rows (5 samples taken*7 days). The prediction 
requires the use of regression algorithms, PLSR and SVMR are very well known and widely used methods. 
For the PLSR, the hold-out validation was used, the dataset has been subdivided into: 80% for the training, 
20% for the test, the three-fist components were selected. In the case of the SVMR, the choice was to use the 
5-folds cross-validation. 
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Figure 3. Different classification results: (a) A and A’ DT results, (b) B and B’ naive Bayes results, 
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Table 2. Methods and their success rate 


Method Success rate 
Decision tree 87.1% 
Naive Bayes 90% 

Support vector machines 91.4% 
Ensemble classifier 92.9% 


Figures 4(a) and 4(b) and Table 3 illustrates the results obtained by said algorithms. According to 
the figure and the table, PLSR did not succeed in predicting the days with great precision, the correlation 
coefficient is small, it is 0.67 with a big MSE greater than 2.0958. Whereas the SVMR attained good 
precision, the correlation coefficient is 0.82 higher than the correlation coefficient of PLSR with a mean 
square error (MSE) of 1.3615 much lower than that of PLSR. 

These results show the superiority of SVMR in predicting the day of mint treatment with malathion. 
To summarize this rich study, a multisensor system carried out with commercial gas sensors achieved good 
results in discriminating mint treated with malathion from untreated one using the ensemble classifier 
algorithms, and for the treatment day prediction, the SVMR is best suited given its results. 
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Figure 4. The results of the regression (a) using PLSR algorithm and (b) using SVMR algorithm 


Table 3. Methods and their correlation coefficient with the MSE 


Method Correlation coefficient (R) Mean square error (MSE) 
Patrial least squares regression (PLSR) 0.67 2.0958 
Support vector machines regression (SVMR) 0.82 1.3615 


4. CONCLUSION 

Chronic diseases and poisoning cases associated with the consumption of food contaminated with 
insecticides are constantly increasing, which requires vigilant monitoring. In the present paper, to analyse the 
mint headspace treated with the malathion insecticide, a multi-sensor system containing three commercial gas 
sensors has been manufactured and utilized. Firstly, four artificial intelligence (AI)-based classification 
methods namely DT, naive bayes, SVM, and ensemble classifier were inspected to distinguish the mint 
treated with malathion from the untreated one. The results demonstrated that the ensemble classifier reached 
the greatest result with a high success rate of 92.9% compared to the others. secondly, two regression 
methods were compared namely SVM regression and PLS regression for the treatment day prediction. 
Numerical results show that SVM regression provides a better correlation with a coefficient of about 0.82. 
The results of this study show that is possible to discriminate between the mint kinds (treated or not) and 
predicted the mint treatment day with good precision if the right machine learning algorithm was well 
adopted using just a simple, portable, and inexpensive multi-sensors system prototype designed in the 
laboratory. 
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